PK
I~X`JY 4
NEWS-20101014002401356-00220-00238-ia360913_meta.xmlUT ff
NEWS-20101014002401356-00220-00238-ia360913
Internet Archive
Heritrix/3.1.1-SNAPSHOT-20100928.230034
newscrawl
Internet Archive
2010
Internet Archive crawldata from international news sites, captured by ia360913.us.archive.org:newscrawl from Thu Oct 14 00:24:01 UTC 2010 to Thu Oct 14 02:19:41 UTC 2010.
20101014002244
00220
http://www.archive.org/details/NEWS-20101014002401356-00220-00238-ia360913
20101014021941
20101014021941
00238
web
10
steve@archive.org
20101014002244
ia360913.us.archive.org
sanfrancisco
10737418240
Internet Archive
crawldata
News Crawldata 2010-10-14T00:24:01UTC to 2010-10-14T02:19:41UTC
news00000
newscrawl
webwidecrawl
2010-11-19 01:41:26
steve@archive.org
2010-11-19 01:41:26
Tue Dec 14 6:10:51 UTC 2010
144332
web
OL100013405
ia903607_11
PK
I~X {1 P
NEWS-20101014003512100-00222-21264~ia360913.us.archive.org~9443.warc.gz_meta.txtUT ffETag: "5fc7b6e07e9337b5b6bf9f357190b1aa"
accept: */*
authorization: LOW ZHZ3zXxTzOJaSZCE:REDACTED_BY_IA_S3
connection: close
content-length: 1013952002
content-md5: 5fc7b6e07e9337b5b6bf9f357190b1aa
host: s3.us.archive.org
user-agent: curl/7.18.2 (x86_64-pc-linux-gnu) libcurl/7.18.2 OpenSSL/0.9.8g zlib/1.2.3.3 libidn/1.10
x-archive-queue-derive: 0
x-upload-date: 2010-11-19T01:43:25.000Z
PK
I~XB P
NEWS-20101014011534351-00228-21264~ia360913.us.archive.org~9443.warc.gz_meta.txtUT ffETag: "8535b09a69d501e795836c8a9c6416cc"
accept: */*
authorization: LOW ZHZ3zXxTzOJaSZCE:REDACTED_BY_IA_S3
connection: close
content-length: 1004209974
content-md5: 8535b09a69d501e795836c8a9c6416cc
host: s3.us.archive.org
user-agent: curl/7.18.2 (x86_64-pc-linux-gnu) libcurl/7.18.2 OpenSSL/0.9.8g zlib/1.2.3.3 libidn/1.10
x-archive-queue-derive: 0
x-upload-date: 2010-11-19T01:45:55.000Z
PK
I~Xˑ" P
NEWS-20101014020348709-00238-21264~ia360913.us.archive.org~9443.warc.gz_meta.txtUT ffETag: "69204335526b28242802ff0c98dc4015"
accept: */*
authorization: LOW ZHZ3zXxTzOJaSZCE:REDACTED_BY_IA_S3
connection: close
content-length: 1000701091
content-md5: 69204335526b28242802ff0c98dc4015
host: s3.us.archive.org
user-agent: curl/7.18.2 (x86_64-pc-linux-gnu) libcurl/7.18.2 OpenSSL/0.9.8g zlib/1.2.3.3 libidn/1.10
x-archive-queue-derive: 0
x-upload-date: 2010-11-19T01:50:31.000Z
PK
I~X~,K P
NEWS-20101014015152819-00236-21264~ia360913.us.archive.org~9443.warc.gz_meta.txtUT ffETag: "655969d966a8339e3fde422d4d332640"
accept: */*
authorization: LOW ZHZ3zXxTzOJaSZCE:REDACTED_BY_IA_S3
connection: close
content-length: 1006970518
content-md5: 655969d966a8339e3fde422d4d332640
host: s3.us.archive.org
user-agent: curl/7.18.2 (x86_64-pc-linux-gnu) libcurl/7.18.2 OpenSSL/0.9.8g zlib/1.2.3.3 libidn/1.10
x-archive-queue-derive: 0
x-upload-date: 2010-11-19T01:49:40.000Z
PK
I~X憌 P
NEWS-20101014014627197-00234-21264~ia360913.us.archive.org~9443.warc.gz_meta.txtUT ffETag: "133b6fb24666243472a65b984b399784"
accept: */*
authorization: LOW ZHZ3zXxTzOJaSZCE:REDACTED_BY_IA_S3
connection: close
content-length: 1005587889
content-md5: 133b6fb24666243472a65b984b399784
host: s3.us.archive.org
user-agent: curl/7.18.2 (x86_64-pc-linux-gnu) libcurl/7.18.2 OpenSSL/0.9.8g zlib/1.2.3.3 libidn/1.10
x-archive-queue-derive: 0
x-upload-date: 2010-11-19T01:48:54.000Z
PK
I~XVfk P
NEWS-20101014013448171-00232-21264~ia360913.us.archive.org~9443.warc.gz_meta.txtUT ffETag: "73fb7824035bac3cb60309ba3d11c5f1"
accept: */*
authorization: LOW ZHZ3zXxTzOJaSZCE:REDACTED_BY_IA_S3
connection: close
content-length: 1267376975
content-md5: 73fb7824035bac3cb60309ba3d11c5f1
host: s3.us.archive.org
user-agent: curl/7.18.2 (x86_64-pc-linux-gnu) libcurl/7.18.2 OpenSSL/0.9.8g zlib/1.2.3.3 libidn/1.10
x-archive-queue-derive: 0
x-upload-date: 2010-11-19T01:47:51.000Z
PK
I~XM0z64 64 5
NEWS-20101014002401356-00220-00238-ia360913_files.xmlUT ff
Text
1290130893
1060
45766d5043a553f5189465d177824282
55e03c73
6bff65dbac82f7bbfaf3c63f4c9f1a563f3cb893
true
Metadata
1290130893
1676
bcedc6117f04eeed2b477af2527b655b
fa2287e7
39871abcc15293735473638f6e2121b80c651eed
1349493668
8563847
6cb4231099dcd029c5a0f0d65dc97c09
5afb647a
fd2577e931c6697481a0058756b79ae321890441
Item CDX Index
true
Metadata
ae3f927afb5a80068ba3356475992f7f
md5
Metadata
1464932283
1716
d24154fd47aaef63fe7662f7d40c531b
94594a60
bf132399201ccd65dd7f8dd7923175d06490f12c
Web ARChive GZ
1290130943
1009669883
79e343f1414a80c30793a0d3dec78ded
af34bf3a
2d0b58ed3a9a9d7a8f5b15bf1b2e5bb230eef999
true
Metadata
1290130943
389
34b5b9699b6c9b6ebe5d7d4c041613d1
d63c9cbf
42ad26d934ea7beaf6a22ad47c5ffe85a95d8afe
WARC CDX Index
NEWS-20101014002401356-00220-21264~ia360913.us.archive.org~9443.warc.gz
1339333823
870374
9914f808ded843e638ef968c2492d1e8
4e01a635
581cc6f90fd75454979ae5d697644ab3985a8f3f
true
Web ARChive GZ
1290131005
1013952002
5fc7b6e07e9337b5b6bf9f357190b1aa
5e5f0f9f
83865a7815ccdd621cfbf030fb48ef0d1b904688
true
Metadata
1290131005
389
eddf445af063d0c43d29c736834695a3
e3317b00
dcd7b9207161d0d06ec3dc81ae8dc12cb5cbd301
WARC CDX Index
NEWS-20101014003512100-00222-21264~ia360913.us.archive.org~9443.warc.gz
1339333485
714536
bbc00776b89aedafdd00e35fb67dd48f
be75daad
57d41906d22945894a030d01c369a6313f36b0f9
true
Web ARChive GZ
1290131069
1301734803
45ceb9a32f4f9f35e6c8efb252b80543
45882869
4e221fac9371462b93bbb7e01f5d7edd0bce6f56
true
Metadata
1290131069
389
7705088482b85d0f1cec7c02a83bbbfb
d4fcb399
375eecb6179c466a6fcf2aa34e1d3d5070ac2495
WARC CDX Index
NEWS-20101014004434631-00224-21264~ia360913.us.archive.org~9443.warc.gz
1339333610
856311
a3df99803c2168eb808cb59bb62cea4b
a4947ba0
4893dbbc8194e138e110411126a74720d45d89a8
true
Web ARChive GZ
1290131111
1001060049
edca7472ff8b4da7b812a1cf60f212ec
9a73dcaf
ea986579c6d1cc39968cb3a1539c3f78500db854
true
Metadata
1290131111
389
1354b3746854e8bc717ec8f9109c013b
1bbd37b2
dfbb7f3526bded4bd7d67d40f6598f622fa24a9b
WARC CDX Index
NEWS-20101014005500563-00226-21264~ia360913.us.archive.org~9443.warc.gz
1339333709
1663090
457ae8ca5dbf3160c01e304a976a094c
83c82d28
a07d68def957ed4152f2acc18dd1e9812adb8fd0
true
Web ARChive GZ
1290131155
1004209974
8535b09a69d501e795836c8a9c6416cc
85d4f8ac
c8899c9bd0363c50503e2d8a0fc634c6e11898fc
true
Metadata
1290131155
389
7db6cf8a5258d664ac801ac578917f16
ae03c142
e6e5cb40326d456d59981c80de01d3e6a72e9792
WARC CDX Index
NEWS-20101014011534351-00228-21264~ia360913.us.archive.org~9443.warc.gz
1339333416
535387
e441dbe079821b8f6c3ade1eab2d54ed
9e96206c
cab9da08b10a25aa28c971a6aae9297217d8649e
true
Web ARChive GZ
1290131208
1005378942
987cc10ccf34760f8622ce781a324c84
8351258c
2ea1aecab58eddf0afda143607f6a6fbbb3ce18e
true
Metadata
1290131208
389
c44bc5bcf36182b9fcfaa646b8593d12
b9f5ff39
7a8a391f34f08bcedcb7db11216c1917181b29b8
WARC CDX Index
NEWS-20101014012156510-00230-21264~ia360913.us.archive.org~9443.warc.gz
1339333337
1062726
c448afdd6ef6f9f7462c165da71a9f9c
62f5174e
880fe8d43dd12001130b647bb71c76c2a70131cd
true
Web ARChive GZ
1290131271
1267376975
73fb7824035bac3cb60309ba3d11c5f1
ae390f51
f0344de01c95acb72b6ca5e7fa6f7b622c2fb03f
true
Metadata
1290131271
389
678c0b1180b00c28bb739cdd9f08eed0
6b665698
11c6d58f101881957505564ff9731425614a789e
WARC CDX Index
NEWS-20101014013448171-00232-21264~ia360913.us.archive.org~9443.warc.gz
1339332973
909751
1122072300c171993646e59928e0c2b1
c356059c
8cd129c3af617ce7f2e2e1dc30ac5537b48ac10a
true
Web ARChive GZ
1290131334
1005587889
133b6fb24666243472a65b984b399784
6de7b206
49d256d9dad9b822b1f8d5f3b1442ab8b52069d2
true
Metadata
1290131334
389
f9d26ef6f1e98f48e24b6965ab16a022
8c86e684
dc5bd0c2c95b96768940e1322a38521e42e519de
WARC CDX Index
NEWS-20101014014627197-00234-21264~ia360913.us.archive.org~9443.warc.gz
1339333031
411936
36d700d8eb8c8fa38bd92882a58c7e37
3a602dbb
baafcc5cdd6d1260aef577487c8f1c1fb6220a64
true
Web ARChive GZ
1290131380
1006970518
655969d966a8339e3fde422d4d332640
1e800a6f
2cb4fb8dfb6481fe0756b1c22cfd6d6b20ac1d9f
true
Metadata
1290131380
389
495e7b27cecc0e527ce75cd3bcdf28f0
4b2ce37e
62371b67f5b992552a08171d289f8d4cd1b88065
WARC CDX Index
NEWS-20101014015152819-00236-21264~ia360913.us.archive.org~9443.warc.gz
1339333096
842210
e41d8661f3b8cf0505e8b4b7ed0014a5
23de5df4
017630f3b7e70b8826e6a4637561639053c46cd9
true
Web ARChive GZ
1290131431
1000701091
69204335526b28242802ff0c98dc4015
510ef81b
6d2e1f2758f8b395191659a8804c86439b5b86f9
true
Metadata
1290131431
389
bbd99f675e1a9fcf1eb275c57a260d73
22d791cb
5b5dbd0dbdc0d7dab3264ec10c856480e41918bd
WARC CDX Index
NEWS-20101014020348709-00238-21264~ia360913.us.archive.org~9443.warc.gz
1339333196
1198697
f5506364066aeb99d9f7699db3ada54f
6cf2a478
ee0aa89f5893470f31cad58676e02cc94b119fe2
true
PK
I~X9 P
NEWS-20101014012156510-00230-21264~ia360913.us.archive.org~9443.warc.gz_meta.txtUT ffETag: "987cc10ccf34760f8622ce781a324c84"
accept: */*
authorization: LOW ZHZ3zXxTzOJaSZCE:REDACTED_BY_IA_S3
connection: close
content-length: 1005378942
content-md5: 987cc10ccf34760f8622ce781a324c84
host: s3.us.archive.org
user-agent: curl/7.18.2 (x86_64-pc-linux-gnu) libcurl/7.18.2 OpenSSL/0.9.8g zlib/1.2.3.3 libidn/1.10
x-archive-queue-derive: 0
x-upload-date: 2010-11-19T01:46:48.000Z
PK
I~Xԅ P
NEWS-20101014004434631-00224-21264~ia360913.us.archive.org~9443.warc.gz_meta.txtUT ffETag: "45ceb9a32f4f9f35e6c8efb252b80543"
accept: */*
authorization: LOW ZHZ3zXxTzOJaSZCE:REDACTED_BY_IA_S3
connection: close
content-length: 1301734803
content-md5: 45ceb9a32f4f9f35e6c8efb252b80543
host: s3.us.archive.org
user-agent: curl/7.18.2 (x86_64-pc-linux-gnu) libcurl/7.18.2 OpenSSL/0.9.8g zlib/1.2.3.3 libidn/1.10
x-archive-queue-derive: 0
x-upload-date: 2010-11-19T01:44:29.000Z
PK
I~X<օ P
NEWS-20101014002401356-00220-21264~ia360913.us.archive.org~9443.warc.gz_meta.txtUT ffETag: "79e343f1414a80c30793a0d3dec78ded"
accept: */*
authorization: LOW ZHZ3zXxTzOJaSZCE:REDACTED_BY_IA_S3
connection: close
content-length: 1009669883
content-md5: 79e343f1414a80c30793a0d3dec78ded
host: s3.us.archive.org
user-agent: curl/7.18.2 (x86_64-pc-linux-gnu) libcurl/7.18.2 OpenSSL/0.9.8g zlib/1.2.3.3 libidn/1.10
x-archive-queue-derive: 0
x-upload-date: 2010-11-19T01:42:23.000Z
PK
I~X"
MANIFEST.txt_meta.txtUT ffETag: "45766d5043a553f5189465d177824282"
accept: */*
authorization: LOW ZHZ3zXxTzOJaSZCE:REDACTED_BY_IA_S3
connection: close
content-length: 1060
host: s3.us.archive.org
user-agent: curl/7.18.2 (x86_64-pc-linux-gnu) libcurl/7.18.2 OpenSSL/0.9.8g zlib/1.2.3.3 libidn/1.10
x-amz-auto-make-bucket: 1
x-archive-meta-contributor: Internet Archive
x-archive-meta-crawler: Heritrix/3.1.1-SNAPSHOT-20100928.230034
x-archive-meta-crawljob: newscrawl
x-archive-meta-creator: Internet Archive
x-archive-meta-date: 2010
x-archive-meta-description: Internet Archive crawldata from international news sites, captured by ia360913.us.archive.org:newscrawl from Thu Oct 14 00:24:01 UTC 2010 to Thu Oct 14 02:19:41 UTC 2010.
x-archive-meta-firstfiledate: 20101014002401356
x-archive-meta-firstfileserial: 00220
x-archive-meta-identifier-access: http://www.archive.org/details/NEWS-20101014002401356-00220-00238-ia360913
x-archive-meta-lastdate: 20101014021941
x-archive-meta-lastfiledate: 20101014020348709
x-archive-meta-lastfileserial: 00238
x-archive-meta-mediatype: web
x-archive-meta-numwarcs: 10
x-archive-meta-operator: steve@archive.org
x-archive-meta-scandate: 20101014002401
x-archive-meta-scanner: ia360913.us.archive.org
x-archive-meta-scanningcenter: sanfrancisco
x-archive-meta-sizehint: 10737418240
x-archive-meta-sponsor: Internet Archive
x-archive-meta-subject: crawldata
x-archive-meta-title: News Crawldata 2010-10-14T00:24:01UTC to 2010-10-14T02:19:41UTC
x-archive-meta01-collection: news00000
x-archive-meta02-collection: newscrawl
x-archive-meta03-collection: webwidecrawl
x-archive-queue-derive: 0
x-archive-size-hint: 10737418240
x-upload-date: 2010-11-19T01:41:33.000Z
PK
I~X7 P
NEWS-20101014005500563-00226-21264~ia360913.us.archive.org~9443.warc.gz_meta.txtUT ffETag: "edca7472ff8b4da7b812a1cf60f212ec"
accept: */*
authorization: LOW ZHZ3zXxTzOJaSZCE:REDACTED_BY_IA_S3
connection: close
content-length: 1001060049
content-md5: edca7472ff8b4da7b812a1cf60f212ec
host: s3.us.archive.org
user-agent: curl/7.18.2 (x86_64-pc-linux-gnu) libcurl/7.18.2 OpenSSL/0.9.8g zlib/1.2.3.3 libidn/1.10
x-archive-queue-derive: 0
x-upload-date: 2010-11-19T01:45:11.000Z
PK-
I~X`JY 4 @ NEWS-20101014002401356-00220-00238-ia360913_meta.xmlUT fPK-
I~X {1 P @ NEWS-20101014003512100-00222-21264~ia360913.us.archive.org~9443.warc.gz_meta.txtUT fPK-
I~XB P @ NEWS-20101014011534351-00228-21264~ia360913.us.archive.org~9443.warc.gz_meta.txtUT fPK-
I~Xˑ" P @ NEWS-20101014020348709-00238-21264~ia360913.us.archive.org~9443.warc.gz_meta.txtUT fPK-
I~X~,K P @
NEWS-20101014015152819-00236-21264~ia360913.us.archive.org~9443.warc.gz_meta.txtUT fPK-
I~X憌 P @ NEWS-20101014014627197-00234-21264~ia360913.us.archive.org~9443.warc.gz_meta.txtUT fPK-
I~XVfk P @ NEWS-20101014013448171-00232-21264~ia360913.us.archive.org~9443.warc.gz_meta.txtUT fPK-
I~XM0z64 64 5 @ NEWS-20101014002401356-00220-00238-ia360913_files.xmlUT fPK-
I~X9 P @G NEWS-20101014012156510-00230-21264~ia360913.us.archive.org~9443.warc.gz_meta.txtUT fPK-
I~Xԅ P @I NEWS-20101014004434631-00224-21264~ia360913.us.archive.org~9443.warc.gz_meta.txtUT fPK-
I~X<օ P @K NEWS-20101014002401356-00220-21264~ia360913.us.archive.org~9443.warc.gz_meta.txtUT fPK-
I~X" @M MANIFEST.txt_meta.txtUT fPK-
I~X7 P @uT NEWS-20101014005500563-00226-21264~ia360913.us.archive.org~9443.warc.gz_meta.txtUT fPK
i uV